Expected Loss Optimization for Document Ranking by Active Learning
نویسندگان
چکیده
Learning to rank is the emerging research field in many data mining applications and information retrieval techniques (e.g. Search engines). The major issue in ranking algorithm is that the quality or ranking is affected by labeled examples, since it is very expensive and also time consuming to collect labeled samples. This problem brings a great need for active learning algorithm; however, in literature learning to rank uses supervised learning algorithm where ranking is based on labeled data only. A general active learning framework Balanced two stage Expected Loss Optimization is proposed to select the most informative document based on user’s query. The algorithm is based on two levels, Query level and Document level and grade distribution is done based on query and document pairs. Experiment on web search dataset has demonstrated with the proposed algorithm.
منابع مشابه
On Efficient Heuristic Ranking of Hypotheses
This paper considers the problem of learning the ranking of a set of alternatives based upon incomplete information (e.g., a limited number of observations). We describe two algorithms for hypothesis ranking and their application for probably approximately correct (PAC) and expected loss (EL) learning criteria. Empirical results are provided to demonstrate the effectiveness of these ranking pro...
متن کاملRRLUFF: Ranking function based on Reinforcement Learning using User Feedback and Web Document Features
Principal aim of a search engine is to provide the sorted results according to user’s requirements. To achieve this aim, it employs ranking methods to rank the web documents based on their significance and relevance to user query. The novelty of this paper is to provide user feedback-based ranking algorithm using reinforcement learning. The proposed algorithm is called RRLUFF, in which the rank...
متن کاملA Meta-Learning Approach for Robust Rank Learning
Learning effective feature-based ranking functions is a fundamental task for search engines, and has recently become an active area of research [10, 3, 2]. Many of these recent algorithms are based on the pairwise preference framework, in which instead of taking documents in isolation, document pairs are used as instances in the learning process. One disadvantage of this process is that a noisy...
متن کاملWeb pages ranking algorithm based on reinforcement learning and user feedback
The main challenge of a search engine is ranking web documents to provide the best response to a user`s query. Despite the huge number of the extracted results for user`s query, only a small number of the first results are examined by users; therefore, the insertion of the related results in the first ranks is of great importance. In this paper, a ranking algorithm based on the reinforcement le...
متن کاملConvergence to Global Optimality with Sequential Bayesian Sampling Policies
We consider Bayesian information collection, in which a measurement policy collects information to support a future decision. This framework includes problems in ranking and selection, reinforcement learning, and continuous global optimization. We give sufficient conditions under which measurement policies achieve asymptotically minimal expected loss. Achieving asymptotically minimal expected l...
متن کامل